{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Technical Note on the Baseline Regression Implementation\n",
    "\n",
    "\n",
    "## The Regression Model\n",
    "\n",
    "The general regression problem is written as,\n",
    "$$\\mathbf{y} = \\mathbf{X}\\beta + \\epsilon\\text{,}$$\n",
    "\n",
    "where $\\mathbf{y}$ is the length $n$ vector of observations, $\\beta$ is the length $m$ vector of predictor coefficients, $\\mathbf{X}$ is the $n\\times m$ matrix of predictors, and $\\epsilon$ are the residuals.  The goal of the regression is to find the values of $\\beta$ which minimize the quantity\n",
    "$$(\\mathbf{y} - \\mathbf{X}\\beta)^T\\mathbf{\\Omega}^{-1}(\\mathbf{y} -\\mathbf{X}\\beta)\\text{,}$$\n",
    "\n",
    "where $\\mathbf{\\Omega}$ is the covariance of the observations.  The problem admits a direct solution,\n",
    "$$\\beta = (\\mathbf{X}^T \\Omega^{-1} \\mathbf{X})^{-1} \\mathbf{X}^T \\mathbf{\\Omega}^{-1} \\mathbf{y}\\text{,}$$\n",
    "\n",
    "which can also be used to obtain an error estimate of $\\beta$, assuming $\\mathbf{\\Omega}$ is correctly specified.  \n",
    "\n",
    "The regression is performed in an iterative procedure (Cochrane and Orcutt, 1949) with $\\mathbf{\\Omega}$ set to unity for the first iteration.  The first iteration is equivalent to unweighted ordinary least squares.  After the first iteration the autocorrelation coefficient, $\\rho$, is calculated through,\n",
    "$$\\rho = \\frac{\\sum_{i=2}^{N} (\\epsilon_i - \\overline{\\epsilon})(\\epsilon_{i-1} - \\overline{\\epsilon})}{\\sum_{i=1}^N (\\epsilon_i - \\overline{\\epsilon})^2}\\text{,}$$\n",
    "\n",
    "where $\\overline{\\epsilon}$ is the mean value of the residuals.  Typical values for the autocorrelation coefficient are $\\sim 0.2-0.3$.  For the next iteration the covariance matrix is modified taking into account the autocorrelation (Prais and Winsten, 1954),\n",
    "$$\\mathbf{\\Omega} = \\begin{bmatrix}\\frac{1}{1-\\rho^2} & \\frac{\\rho}{1-\\rho^2} & \\frac{\\rho^2}{1-\\rho^2} & \\cdots &\\frac{\\rho^{n-1}}{1-\\rho^2} \\\\[8pt]\\frac{\\rho}{1-\\rho^2} & \\frac{1}{1-\\rho^2} & \\frac{\\rho}{1-\\rho^2} & \\cdots & \\frac{\\rho^{n-2}}{1-\\rho^2} \\\\[8pt]\\frac{\\rho^2}{1-\\rho^2} & \\frac{\\rho}{1-\\rho^2}  & \\frac{1}{1-\\rho^2} & \\cdots &  \\frac{\\rho^{n-3}}{1-\\rho^2} \\\\[8pt]\\vdots & \\vdots & \\vdots & \\ddots & \\vdots \\\\[8pt]\\frac{\\rho^{n-1}}{1-\\rho^2} & \\frac{\\rho^{n-2}}{1-\\rho^2} & \\frac{\\rho^{n-3}}{1-\\rho^2} & \\cdots & \\frac{1}{1-\\rho^2}\\end{bmatrix}\\text{.}$$\n",
    "\n",
    "The covariance matrix is modified accordingly to account for measurement gaps (Savin and White, 1978).  This procedure is repeated until the autocorrelation coefficient has converged within some tolerance level.  The final error estimate is calculated by scaling $\\mathbf{\\Omega}$ to match the observed variance of the residuals.\n",
    "\n",
    "## Predictors/Configurations\n",
    "There are three standard configurations, the piecewise linear trend (PWLT), independent linear trend (ILT) and the EESC trend configuration.  The regression procedure is the same for all configurations, the only difference is the predictors used in the model.  Common to all of these baseline regression setups are a set of predictors intended to account for the natural variability of the measurements.\n",
    "\n",
    "* Two orthogonal components of the QBO from http://www.geo.fu-berlin.de/met/ag/strat/produkte/qbo/qbo.dat which are calculated using principal component analysis\n",
    "* Solar 10.7 cm flux from http://www.spaceweather.ca/data-donnee/sol_flux/sx-5-mavg-eng.php\n",
    "* ENSO without lag from http://www.esrl.noaa.gov/psd/enso/mei/table.html\n",
    "* GISS AOD from https://data.giss.nasa.gov/modelforce/strataer/ which is extended passed 2012 by extrapolating the last value\n",
    "\n",
    "For convenience all of these predictors have been scaled to have mean 0 and standard deviation 1.\n",
    "\n",
    "![Default Predictors](images/predictors_default.png)\n",
    "Figure 1  Standard predictors uses in all regression schemes.\n",
    "\n",
    "### Piecewise Linear Trends\n",
    "\n",
    "The piecewise linear trend consists of two predictors, the first being a linear increase until January 1997 and zero afterwards, and the second zero until January 1997 and a linear increase afterwards.  The two predictors are both zero in January 1997 and are thus linked together.  In addition an overall constant is added which shifts the value at the inflection point.\n",
    "\n",
    "![PWLT Predictors](images/predictors_pwlt.png)\n",
    "Figure 2  Predictors added for the piecewise linear trend scheme.\n",
    "\n",
    "### Independent Linear Trends\n",
    "\n",
    "The independent linear trend is created using five predictors:\n",
    "\n",
    "* A linear increase until January 1997 and zero afterwards\n",
    "* Zero until January 2000, and a linear increase afterwards\n",
    "* Constant until January 1997 and zero afterwards\n",
    "* Zero until January 2000 and constant aftwards\n",
    "* Constant between January 1997 and January 2000 and zero elsewhere\n",
    "\n",
    "In both the \"pre\" and \"post\" time periods there is both a linear and constant component, allowing the two trends to be treated almost independently of each other.  In the gap period between the two an additional constant is added.  Because of the nature of the constants added the overall fitted time series is likely to contain jumps in both 1997 and 2000 where the respective trends begin and end.\n",
    "\n",
    "![ILT Predictors](images/predictors_ilt.png)\n",
    "Figure 3  Predictors added for the independent linear trend scheme.\n",
    "\n",
    "### Two Orthogonal EESC Trends\n",
    "\n",
    "Here the overall trend in ozone is accounted for by including two orthogonal components of the EESC.  The two components are calculated using principal component analysis on the EESC for different mean ages of air.  \n",
    "\n",
    "![EESC Predictors](images/predictors_eesc.png)\n",
    "Figure 4  Predictors added for the EESC trend scheme.\n",
    "\n",
    "## References\n",
    "Cochrane, D., & Orcutt, G. H. (1949). Application of least squares regression to relationships containing auto-correlated error terms. Journal of the American statistical association, 44(245), 32-61.\n",
    "\n",
    "Prais, S. J., & Winsten, C. B. (1954). Trend estimators and serial correlation (Vol. 383, pp. 1-26). Chicago: Cowles Commission discussion paper.\n",
    "\n",
    "Savin, N. E., & White, K. J. (1978). Testing for Autocorrelation with Missing observations. Econometrica (Pre-1986), 46(1), 59."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "celltoolbar": "Edit Metadata",
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.6.4"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}